Model Selection

Efficient attention mechanism

# Efficient attention mechanism

Tweety 7b Dutch V24a

Tweety-7b-dutch is a foundational large language model specialized in Dutch, based on the Mistral architecture, optimized for Dutch text processing with a Dutch tokenizer.

Large Language Model

Transformers Other

Mistral 7B Instruct V0.2 Sparsity 20 V0.1

Mistral-7B-Instruct-v0.2 is an instruction-finetuned large language model improved from Mistral-7B-Instruct-v0.1, compressed to 2% sparsity using Wanda pruning method while maintaining competitive performance without retraining.

Large Language Model

Mpt 7b 8k Instruct

MPT-7B-Instruct-8k is a model for long-format instruction following, especially good at answering questions and summarizing long documents.

Large Language Model

Transformers Other

Chinese Bigbird Base 4096

Chinese pre-trained model based on BigBird architecture, supporting 4096-length context processing

Large Language Model

Transformers Chinese

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase